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SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: Hook, Magnus 

7 Patti, Joseph M. 

8 House- Pompeo, Karen 

9 Sthanam, Narayana 
10 Symersky, Jindrich 

12 (ii) TITLE OF INVENTION: COLLAGEN BINDING PROTEIN COMPOSITIONS 

13 AND METHODS OF USE 
15 (iii) NUMBER OF SEQUENCES: 8 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: Arnold, White & Durkee 

19 (B) STREET: P.O. Box 4433 
2 0 (C) CITY: Houston 

21 (D) STATE: Texas 

22 (E) COUNTRY: U.S. 

23 (F) ZIP: 77210 

25 (v) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

2 7 (B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

2 9 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

31 (vi) CURRENT APPLICATION DATA: 

C--> 32 (A) APPLICATION NUMBER: US/09/813 f 820 

C--> 33 (B) FILING DATE: 22-Mar-2001 [ 

34 (C) CLASSIFICATION: 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: 08/856,253 

38 (B) FILING DATE: 

4 0 (viii) ATTORNEY/AGENT INFORMATION: 

41 (A) NAME: Kitchell, Barbara S. 

42 (B) REGISTRATION NUMBER: 33,928 

4 3 (C) REFERENCE/DOCKET NUMBER: TAMK:193 

4 5 (ix) TELECOMMUNICATION INFORMATION: 

46 (A) TELEPHONE: (512) 418-3000 

47 (B) TELEFAX: (512) 474-7577 
50 (2) INFORMATION FOR SEQ ID NO : 1: 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 441 base pairs 

54 (B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : single 

56 (D) TOPOLOGY: linear 

58 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

60 ATAACATCTG GGAATAAATC AACGAATGTT ACGGTTCATA AAAGTGAAGC GGGAACAAGT 60 

62 AGTGTTTTCT ATTATAAAAC GGGAGATATG CTACCAGAAG ATACGACACA TGTACGATGG 120 

64 TTTTTAAATA TTAACAATGA AAAAAGTTAT GTATCGAAAG ATATTACTAT AAAGGATCAG 180 

6 6 ATTCAAGGTG GACAGCAGTT AGATTTAAGC ACATTAAACA TTAATGTGAC AGGTACACAT 240 
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68 AGCAATTATT ATAGTGGACA AAGTGCAATT ACTGATTTTG AAAAAGCCTT TCCAGGTTCT 300 

70 AAAATAACTG TTGATAATAC GAAGAACACA ATTGATGTAA CAATTCCACA AGGCTATGGG 360 

72 TCATATAATA GTTTTTCAAT TAACTACAAA ACCAAAATTA CGAATGAACA G C AAAAAG AG 420 

74 TTTGTTAATA ATTCACAAGC T 441 

77 (2) INFORMATION FOR SEQ ID NO : 2 : 



79 (i) SEQUENCE CHARACTERISTICS: 

80 (A) LENGTH: 159 amino acids 

81 (B) TYPE: amino acid 

82 (C) STRANDEDNESS: 

83 (D) TOPOLOGY: linear 

85 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

87 Met Arg Gly Ser His His His His His His Gly Ser lie Thr Ser Gly 

88 1 5 10 15 

90 Asn Lys Ser Thr Asn Val Thr Val His Lys Ser Glu Ala Gly Thr Ser 

91 20 25 30 

93 Ser Val Phe Tyr Tyr Lys Thr Gly Asp Met Leu Pro Glu Asp Thr Thr 

94 35 40 45 

96 His Val Arg Trp Phe Leu Asn lie Asn Asn Glu Lys Ser Tyr Val Ser 

97 50 55 60 

99 Lys Asp lie Thr lie Lys Asp Gin lie Gin Gly Gly Gin Gin Leu Asp 

100 65 70 75 80 
1.02 Leu Ser Thr Leu Asn lie Asn Val Thr Gly Thr His Ser Asn Tyr Tyr 
103 85 90 95 

105 .Ser Gly Gin Ser Ala lie Thr Asp Phe Glu Lys Ala Phe Pro Gly Ser 

106 100 105 110 

108 Lys lie Thr Val Asp Asn Thr Lys Asn Thr lie Asp Val Thr lie Pro 

109 115 120 125 

111 Gin Gly Tyr Gly Ser Tyr Asn Ser Phe Ser lie Asn Tyr Lys Thr Lys 

112 130 135 140 

114 lie Thr Asn Glu Gin Gin Lys Glu Phe Val Asn Asn Ser Gin Ala 

115 145 150 155 
118 (2) INFORMATION FOR SEQ ID NO : 3 : 

12 0 (i) SEQUENCE CHARACTERISTICS: 

121 (A) LENGTH: 849 base pairs 

122 (B) TYPE: nucleic acid 

123 (C) STRANDEDNESS: single 

124 (D) TOPOLOGY: linear 

126 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 



128 GACGATAAAA ATGGAAAAAT ACAAAATGGT GACATGATTA AAGTGGCATG GCCGACAAGC 6 0 

130 GGTACAGTAA AGATAGAGGG TTATAGTAAA ACAGTACCAT TAACTGTTAA AGGTGAACAG 120 

132 GTGGGTCAAG CAGTTATTAC ACCAGACGGT GCAACAATTA CATTCAATGA TAAAGTAGAA 180 

134 AAATTAAGTG ATGTTTCGGG ATTTGCAGAA TTTGAAGTAC AAGGAAGAAA TTTAACGCAA 240 

136 ACAAATACTT CAGATGACAA AGTAGCTACG ATAACATCTG GGAATAAATC AACGAATGTT 300 

138 ACGGTTCATA AAAGTGAAGC GGGAACAAGT AGTGTTTTCT ATTATAAAAC GGGAGATATG 360 

14 0 CTACCAGAAG ATACGACACA TGTACGATGG TTTTTAAATA TTAACAATGA AAAAAGTTAT 420 

142 GTATCGAAAG ATATTACTAT AAAGGATCAG ATTCAAGGTG GACAGCAGTT AGATTTAAGC 480 

144 ACATTAAACA TTAATGTGAC AGGTACACAT AGCAATTATT ATAGTGGACA AAGTGCAATT 540 

14 6 ACTGATTTTG AAAAAGCCTT TCCAGGTTCT AAAATAACTG TTGATAATAC GAAGAACACA 600 ■* 

14 8 ATTGATGTAA CAATTCCACA AGGCTATGGG TCATATAATA GTTTTTCAAT TAACTACAAA 660 



file://C:\Crf3\Outhold\VsrI813820.htm 



12/17/01 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 12/17/2001 

PATENT APPLICATION: US/09/813,820 TIME: 10:19:16 

Input Set : N:\Crf3\RULE60\09813820.txt 
Output Set: N:\CRF3\12172001\I813820.raw 



150 ACCAAAATTA CGAATGAACA GCAAAAAGAG TTTGTTAATA ATTCACAAGC TTGGTATCAA 720 
152 GAGCATGGTA AGGAAGAAGT GAACGGGAAA TCATTTAATC ATACTGTGCA CAATATTAAT 780 
154 GCTAATGCCG GTATTGAAGG TACTGTAAAA GGTGAATTAA AAGTTTTAAA ACAGGATAAA 84 0 

156 GATACCAAG 84 9 

159 (2) INFORMATION FOR SEQ ID NO: 4: 

161 (i) SEQUENCE CHARACTERISTICS: 

162 (A) LENGTH: 211 amino acids 

163 (B) TYPE: amino acid 

164 (C) STRANDEDNESS: 

165 (D) TOPOLOGY: linear 

167 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

169 Met Arg Gly Ser His His His His His His Gly Ser Asp Asp Lys Val 

170 15 10 15 

172 Ala Thr lie Thr Ser Gly Asn Lys Ser Thr Asn Val Thr Val His Lys 

173 20 25 30 

175 Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr Gly Asp Met 

176 35 40 45 

178 Leu Pro Glu Asp Thr Thr His Val Arg Trp Phe Leu Asn lie Asn Asn 

179 50 55 60 

181 Glu Lys Ser Tyr Val Ser Lys Asp lie Thr lie Lys Asp Gin lie Gin 

182 65 70 75 80 

184 Gly Gly Gin Gin Leu Asp Leu Ser Thr Leu Asn lie Asn Val Thr Gly 

185 85 90 95 

187 Thr His Ser Asn Tyr Tyr Ser Gly Gin Ser Ala lie Thr Asp Phe Glu 

188 100 105 110 

190 Lys Ala Phe Pro Gly Ser Lys lie Thr Val Asp Asn Thr Lys Asn Thr 

191 115 120 125 

193 lie Asp Val Thr lie Pro Gin Gly Tyr Gly Ser Tyr Asn Ser Phe Ser 

194 130 135 140 

196 lie Asn Tyr Lys Thr Lys lie Thr Asn Glu Gin Gin Lys Glu Phe Val 

197 145 150 155 160 

199 Asn Asn Ser Gin Ala Trp Tyr Gin Glu His Gly Lys Glu Glu Val Asn 

200 165 170 175 

202 Gly Lys Ser Phe Asn His Thr Val His Asn lie Asn Ala Asn Ala Gly 

203 180 185 190 

205 lie Glu Gly Thr Val Lys Gly Glu Leu Lys Val Leu Lys Gin Asp Lys 

206 195 200 205 

208 Asp Thr Lys 

209 210 

212 (2) INFORMATION FOR SEQ ID NO : 5: 

214 (i) SEQUENCE CHARACTERISTICS: 

215 (A) LENGTH: 1500 base pairs 

216 (B) TYPE: nucleic acid 

217 (C) STRANDEDNESS: single 

218 (D) TOPOLOGY: linear 

220 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

222 GCACGAGATA TTTCATCAAC GAATGTTACA GATTTAACTG TATCACCGTC TAAGATAGAA 60 
224 GATGGTGGTA AAACGACAGT AAAAATGACG TTCGACGATA AAAATGGAAA AATACAAAAT 120 
226 GGTGACATGA TTAAAGTGGC ATGGCCGACA AGCGGTACAG TAAAGATAGA GGGTTATAGT 180 



file://C:\Crf3\Outhold\VsrI8 1 3820.htm 



12/17/01 



Page 4 of 7 




RAW SEQUENCE LISTING DATE: 12/17/2001 

PATENT APPLICATION: US/09/813,820 TIME: 10:19:16 

Input Set : N:\Crf3\RULE60\09813820.txt 
Output Set: N:\CRF3\12172001\I813820.raw 

228 AAAAC AG T AC CATTAACTGT TAAAGGTGAA CAGGTGGGTC AAGCAGTTAT TACACCAGAC 24 0 

230 GGTGCAACAA TTACATTCAA TGATAAAGTA GAAAAATTAA GTGATGTTTC GGGATTTGCA 300 

232 GAATTTGAAG TACAAGGAAG AAATTTAACG CAAACAAATA CTTCAGATGA CAAAGTAGCT 360 

234 ACGATAACAT CTGGGAATAA ATCAACGAAT GTTACGGTTC ATAAAAGTGA AGCGGGAACA 420 

236 AGTAGTGTTT TCTATTATAA AACGGGAGAT ATGCTACCAG AAGATACGAC ACATGTACGA 480 

238 TGGTTTTTAA ATATTAACAA TGAAAAAAGT TATGTATCGA AAGATATTAC TATAAAGGAT 54 0 

24 0 CAGATTCAAG GTGGACAGCA GTTAGATTTA AGCACATTAA ACATTAATGT GACAGGTACA 6 00 

242 CATAGCAATT ATTATAGTGG ACAAAGTGCA ATTACTGATT TTGAAAAAGC CTTTCCAGGT 660 

244 TCTAAAATAA CTGTTGATAA TACGAAGAAC ACAATTGATG TAACAATTCC ACAAGGCTAT 72 0 

24 6 GGGTCATATA ATAGTTTTTC AATTAACTAC AAAACCAAAA TTACGAATGA ACAGCAAAAA 780 

248 GAGTTTGTTA ATAATTCACA AGCTTGGTAT CAAGAGCATG GTAAGGAAGA AGTGAACGGG 840 

250 AAATCATTTA ATCATACTGT GCACAATATT AATGCTAATG CCGGTATTGA AGGTACTGTA 900 

252 AAAGGTGAAT TAAAAGTTTT AAAACAGGAT AAAGATACCA AGGCTCCTAT AGCTAATGTA 960 

254 AAATTTAAAC TTTCTAAAAA AGATGGATCA GTTGTAAAGG ACAATCAAAA AGAAATTGAG 1020 

256 ATTATAACAG ATGCAAACGG TATTGCTAAT ATTAAAGCGT TGCCTAGTGG AGACTATATT 1080 

258 TTAAAAGAAA TAGAGGCGCC ACGACCGTAT ACATTTGATA AGGATAAAGA ATATCCGTTT 114 0 

260 ACTATGAAAG ATACAGATAA TCAGGGATAT TTTACGACTA TTGAAAATGC AAAAG CG AT A 1200 

262 GAAAAAACAA AAGATGTTTC TGCTCAAAAG GTTTGGGAAG GCACTCAAAA AGTGAAACCA 126 0 

26 4 ACGATTTATT TCAAGTTGTA CAAACAAGAT GACAATCAAA ATACAACACC AGTAGACAAA 132 0 

26 6 GCAGAGATTA AAAAATTAGA AGATGGAACG ACAAAAGTGA CATGGTCTAA TCTTCCGGAA 1380 

26 8 AATGACAAAA ATGGCAAGGC TATTAAATAT TTAGTTAAAG AAGTAAATGC TCAAGGTGAA 144 0 
270 GATACAACAC CAGAAGGATA TACTAAAAAA GAAAATGGTT TAGTGGTTAC TAATACTGAA 1500 

27 3 (2) INFORMATION FOR SEQ ID NO: 6: 



27 5 (i) SEQUENCE CHARACTERISTICS: 

276 . (A) LENGTH: 512 amino acids 

277 (B) TYPE: amino acid 

278 (C) STRANDEDNESS : 

279 (D) TOPOLOGY: linear 

281 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

283 Met Arg Gly Ser His His His His His His Gly Ser Ala Arg Asp lie 

284 15 10 15 

286 Ser Ser Thr Asn Val Thr Asp Leu Thr Val Ser Pro Ser Lys lie Glu 

287 20 25 30 

289 Asp Gly Gly Lys Thr Thr Val Lys Met Thr Phe Asp Asp Lys Asn Gly 

290 35 40 45 

292. Lys lie Gin Asn Gly Asp Met lie Lys Val Ala Trp Pro Thr Ser Gly 

293 50 55 60 

295 Thr Val Lys lie Glu Gly Tyr Ser Lys Thr Val Pro Leu Thr Val Lys 

296 65 70 75 80 

298 Gly Glu Gin Val Gly Gin Ala Val He Thr Pro Asp Gly Ala Thr He 

299 85 90 95 

301 Thr Phe Asn Asp Lys Val Glu Lys Leu Ser Asp Val Ser Gly Phe Ala 

302 100 105 110 

304 Glu Phe Glu Val Gin Gly Arg Asn Leu Thr Gin Thr Asn Thr Ser Asp 

305 115 120 125 

307 Asp Lys Val Ala Thr He Thr Ser Gly Asn Lys Ser Thr Asn Val Thr 

308 130 135 140 

310 Val His Lys Ser Glu Ala Gly Thr Ser Ser Val Phe Tyr Tyr Lys Thr 

311 145 150 155 160 
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380 (2) INFORMATION FOR SEQ ID NO : 7: 

382 (i) SEQUENCE CHARACTERISTICS: 

383 (A) LENGTH: 34 5 amino acids 

384 (B) TYPE: amino acid 

385 (C) STRANDEDNESS : 
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